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PROCESS FOR MAKING GENES ENCODING RANDOM POLYMERS OF AMINO ACIDS 


Background of the Invention 


Copolymer 1 (COP-1) is a synthetic polypeptide analog of myelin basic protein (MBP) which is a 
natural component of the myelin sheath. It has been suggested as a potential therapeufc agent for multiple 
sde^isTEur J- Immunol. [1971] 1:242; and J. Neurol. Sci. [1977] 31:433). '"tarest in COP-1 as an 
rmm^oUrapy for multiple sclerosis stems from observations first made in the that myehn 

wZonents such as MBP prevent or arrest experimental autoimmune encephalomyel.tis (EAE). EAE .s a 
disease resembling multiple sclerosis that can be induced in susceptible animals. 

COP™ was developed by Drs. Se.a. Amon, and their co-workers at the Weizmann Institute (Rehovot. 
Israeli It was shown to suppress experimental allergic encephalomyelitis (EAE) (Eur. J- Immunol. [1971] 
SSki US Patent Na 3 849.550°. More recently. COP-1 was shown to be beneficial for patients with 
^ exacerbating-remitHng form of multiple sclerosis (N. Eng.. J. Med. [1987] 317:408-41 4). 1MM 
witt dally injections of COP-1 had fewer exacerbations and smaller increases .n their Usability status than 

th ° COFM ?s aU a e mTxture of polypeptides composed of alanine, glutamic acid, lysine, and tyrosine In a molar 
ratio of approximately 6:2:5:1. respectively. It is synthesized by chemically polymerizing the four ammo 
addsfomTng products with average molecular weights of 23.000 daltons. Although the resuting polypep- 
tides a°e comprised of the same amino acid components, they differ with respect to their ammo acd 
sequences in fact, there are 10'°° possible ways to assemble a 23.000 dalton polypept.de composed o 
Sne glutamic acid, lysine and tyrosine in the designated ratios Purification of one or even a small 
number of distinct COP-1 polypeptides from chemically-synthesized COP-1 is not possible. 

Studies evaluating COP-1's efficacy have been hindered somewhat by incons stent batches of COP-1. 
Also Tis not known which of the amino acid P olymer(s) is responsible for the b,olog.cal activity of COP-1. 
oteV random sequence amino acid copolymers re.ated to COP-1 have been "^^^^ 
tested for the ability to suppress experimental allergic encephalomyelrtis (Eur. J. Immunol. [1973] 3.273. 
mmunochrmistry *976] 13:333). Biological activity was observed in EAE assays us.ng COP-1 related 
powers in whS, one of the following changes occurs: tyrosine is replaced by tryptophan: or glutamic acd 
is reolaced bv aspartic acid; or tyrosine is excluded. ■_,«.« 

We have developed procedures for synthesizing genes encoding polypeptides composed of specific 
amino acids, but having random amino acid sequences. The amino add composition of the po YPeptides is 
dSedby ihe set of codons incorporated in the synthetic genes. Likewise, the size of the polypeptides is 
controlled by synthesizing genes of specific lengths. 


Brief Summary of the Invention 


The subject' invention concerns' a method for synthesizing genes which encode random Polymers of 
amino acids. A further aspect of the invention is the identification of certain polypeptides which i are 
expressed by the synthetic genes and which have high levels of biological activity. The general method- 
ology of the subject invention is outlined in Figure 1. „ rf „«„„ „. « ma ii 

A critical step in the novel process of the subject Invention is the polymenzation of small 
oligonuSTSde duplexes. Preferably, the oligonucleotide duplexes consist of a multiple of *ree nucleotides. 
Tne tenoi of the synthetic genes can be controlled through the use of adaptors speafic for the 5 and 3 
Ss Se oligonucleotides. Further, the composition of the resultant polypeptides can be varied with 
reTpecV to the relative proportions of the amino acid constituents, by varying the proportions of input 

^ThtSeslsTpSpeptides similar to COP-1 exempts the procedures of the subject invention^ The 
initiaTSep procedure is the synthesis of genes which cod for polypeptides consisting I °fpredeter- 
mmed lino acid constituents. These genes are then cloned in an expression vector and Introduced Into E. 
cTsucTS each r comblnant bacterial colony contains on COP-1 gen . T generat a mixUir _ of COP-1 
gfypeptides (analogous to the chemically synthesiz d product) we produce COP-1 polypeptides from a 
pool of recombinant bacterial colonies containing COP-1 gene sequenc s. e.g.. 1000 colon.es. 
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The efficacy of the pool of recombinant COP-1 polypeptides Is tested in /Pimento! a lergic 
encepTalornyeSs (EAE) assays. If effective, the pool of colonies exhibiting acfivty ,s farther subdivided 
LT SooTof 100 f colonies), and the polypeptides from these smaller pools are tested. By sequenbaJly 
factioSating and se.ecting the most active pools, we identify individual recombinant COP-1 Polypeptdes or 
S oroujs of polypeptides with biological activity in EAE assays equal to or h,gher than chermcaJly 
SeS COP-1. The opportunity to characterize homogeneous, individual COP-1 po.ypept.des ,s unique 

to *The P XS invention also concerns the synthetic genes and the poiypeptides produced by the 
meth^s dSed he^n. Advantageously, the procedures of the subject invention can be used to produce 
cXoTotides which may be useful in preventing, arresting, or controlling demyel.nat.ng disorders such as 
^Efscierosis A preferred copolymer according to the subject invention cons.sts substantially of 
Sne lys nHd JL glutamicVaspartic acid. Polymers of any length or -o'ecu.ar wdght [ can be 
svXsized using the procedures of the subject invention. Another preferred copolymer further includes 
eCr Cosine or tryptophan. More specifically, a preferred copolymer may consist of alan me lysine 
gMamic aSd a^d Cosine, and have a mo.ecular weight between about 5.000 and 50,000 daltons. Further, 
the method of the subject invention can also be used to make fusion proteins. 


Brief Description of the Drawings 


Figure 1 depicts the general methodology for synthesizing random genes and identifying polypep- 
tides wnh bjolc f*£% spec . fic strategy fof ^e^g genes encoding random-sequence polypep- 

25 tides. 3 poss!We 3 amino combjnations ^ their per cent occurrence In COP-1. 

Figure 4 shows 9-nucleotide duplexes and adaptors for COP-1 gene synthesis. 
Fiaure 5 shows the construction and cloning of synthetic COP-1 genes. K «^»»o 
Figure 6 provides the sequence analysis of one synthetic gene reveaiing proper junctions between 
30 duplexes 7 ^ a westem b£ot ^^.^ prote . n produced by four efferent clones. 

Figure 8 shows variations of random-gene synthesis using small duplexes. nnl „ mars 
Fiaure 9 shows synthesis of single-stranded DNA encoding random sequence ammo acd P^^s. 
5S« « I shows a p^osphoramidite trinucieotide for random gene synthesis using a ONA syn- 

35 thesizer. . _ H __ 

Figure 11 shows the DNA and amino acid sequences Of rCOP-1-77. 
Floure 12 shows the DNA and amino acids sequences of rCOP-1-19. 

Figure 13 shows the average EAE scores of disease-induced guinea pigs wh.ch were untreated or 
treated with myelin basic protein. rCOP-1-19. or rCOP-1-77. 

Detailed Description of the Invention 

One strateov for synthesizing genes encoding random-sequence polypeptides is outlined in Figure 2. 
The°xamtleTst«te7Sa synthesis of genes composed of two DNA duplexes. Oligonucleotides cornpns- 
^ tnTd^otexes^ synthestoed and annealed. Each DNA duplex has the same "sticky ends- represented 
v^ LSTi S duplexes are mixed together, sticky ends align, and the ends are Joined In an 
tlZZtic re^ctSS producing long segments of DNA (genes). Since the sticky ends on each duplex are the 
sSTfre duSfxe f2 Tang, and Hgate In any order. Thus, a series of genes composed of the sarne 
d^LTbut t va^ing orders are produced. The polypeptides encoded by these genes w,.l have s.m.lar 

prevr U sl7toVoniruct genes encoding specific proteins. Rrst, random sequence genes " ^<ta"" d * 
««mbnL small oligonucleotide duplexes because more sequence variation .s produced by mnong the 
order TJSZZ Earg* duplexes. In contrast, long oligomers of 30 to 60 nucJ otides are employed 
Z slnthe^ze a gVn e ^coSfl a defined amino dd sequence. Secondly, to construct random-sequence 
™ S *e ^ "«cal s that the duplexes can b Join d together In any 

Srder For Responding todefined amino acid sequences, the duplexes must ligate in a fixed rder. 


40 


45 


SO 


55 


3 


EP 0 383 620 A2 


To achieve this ordering, the sticky ends at each Junction must be unique. Thus, the process of the subject 
invention is unique in that random-sequence genes are synthesized using oligonucleotide duplexes 
encoding small segments of amino acids, and the sticky ends on each duplex are the same. 

Our procedure for synthesizing genes encoding recombinant COP-1 polypeptides entails using 

s oliqonucleotide duplexes encoding segments of 3 amino acids. All possible permutations of the 3 amino 
acid segments comprised of the four COP-1 amino acids and their percent occurrences in COP-1 are 
shown in Figure 3. To make recombinant COP-1 genes we have synthesized oligonucleotides correspond- 
ing to the coding and noncoding strands for some of the 3 amino acid segments (Figure 4). The 
oligonucleotides are phosphorylated at the 5' ends. Complementary pairs of oligonucleotides are annealed. 

,o forming duplexes with the 3 nucleotide extending on each strand: adenosine on the coding strand, and 
thymidine on the noncoding strand. Adenosine and thymidine base pair with one another thus ensuring that 
the duplexes are joined directionally. that Is. coding strands to other coding strands. Since the duplexes 
have the same nucleotide extensions, they can align In any order. When duplexes corresponding to all 
sixty-four 3-amino acid blocks are mixed and ligated. COP-1 genes with all possible sequences are 

,s P rodu ° e < d 3 - ntro| ^ |ength of me syntnetic genes, we have included adaptors specific for the 5 and 3' ends. 
One strand of each duplex adaptor is not phosphorylated (see F.gure 4). As a result, ligation products 
terminate at the hydroxylated ends. By varying the ratio of the adaptor duplexes in the reaction, we can 
control the length of the synthetic genes. The adaptors serve the second function of adding specific 

20 extensions required to directionally clone the ligation products into the vector (F.gure 5) The relative 
proportions of amino acid constituents in the random polymers can also be modulated by mixing the 
oligonucleotide duplexes In different ratios. For example, duplexes can be added to ligations in proportions 
dictated by the percent occurrence of the corresponding amino acid segments in COP-1 (Figure 3). 

The number of different duplexes incorporated determines the sequence complexity of the synthetic 

25 oenes For example. COP-1 genes can be constructed using fewer than 64 duplexes but not all sequence 
combinations will occur. The amount of sequence complexity required will depend on the application of the 

P °' y One t complication arises In producing completely random COP-1 amino acid sequences. For duplexes 
to have the same extensions, the 3* nucleotide on the coding strands must be tiw same. Codons for 
3 o alanine, glutamic acid, and lysine can end with adenosine; however, for tyrosine, the last^ucleotide is erihnr 
cytidine or uridine. Thus, duplexes encoding three amino acid segments ending in tyrosine (fourth column 
onRaure 3) will have different extensions than the other duplexes. This limitation can be overcome by 
making a second noncoding strand for each duplex with an extension of guanosine or adenosine. Because 
this solution requires significantly more DNA synthesis, we have elected to exclude duplexes corresponding 
as to three amino acid segments ending in tyrosine. ^^.^h , n 

Once the synthetic genes are made, they are cloned into a suitable expression vector and ^starred to 
a host capable of expressing the polypeptides. The host may be bacterial or eukaryotic cells. With bacteria, 
cells are grown under conditions that permit formation of recombinant colonies. Each colony will contain 
and express one synthetic gene. The polypeptides expressed by culture from specific colonies are Isolated 
and tested for the relevant biological or immunological activity. For example. COP-1 polypeptides are tested 

ToSreena'aTgTnlm^of polypeptides for activity, it may be advantageous to pool the polypeptides 
before testing. Pools of polypeptides can be generated by either combining colonies and isolating the 
polypeptides produced by the mixed culture or by culturing individual colonies, isolating the po ypeptides 
from each culture, and then combining the purified polypeptides. By testing pools of polypeptides, it is 
possible to more quickly determine which of the colonies express biologically or .rnmunologically active 
pofy7eptia-es. For example. H polypeptides from 100 colonies do not exhibit activity. ^ "^T^ 1 * 
eUminated from further investigation. If. on the other hand, the mixture of polypeptides does exhib.1 Mthe 
S acS. then sequent subsets of the pooled polypeptides can be tested until the active 
so polypeptides, or mixture of polypeptides, is Identified. „i !mI „ a «„„ «,« 

Tests fo some biological properties may be performed directly on the colon.es. thus eliminating the 
~ ne J for isolating polypeptides For example, reactivity of polypeptides with antisera can be assessed in 
colonies grown and lysed on nitrocellulose filters. 

65 Materials and Methods 
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S ynthesis and Phosphorylation of Oligonucleotides. The cod.ng an * "orwodma oj* rJS? D £ 
each^pTiTareTy nthesized by the phosphite triester method with an Applied B.osystems Modal WDNA 
svn*esizer The 5 ends of oligonucleotides are phosphorylated on the DMA synthes.zer us.ng (2-t^4.4 - 
dimethoxytrityloxy)ethyisulfonyl)ethyK2-cyanoethyr)N.N-diisopropyl) phosphoramidite from Glen Research. 


Purfflcation of Oligonucleotides. The phosphorylated form of the oligonucleotide ,s separated from tte 
crudTSfe-br eleclophoresis th rough a 20% acrylamide gel containing 7 M urea. Oligomers are e luted 
torn excSed gi slices and desalted on Sep-Pak C-18 cartridges. Separation of the S phosphorylated 
oTgomer from hydroxylated forms is critical because hydroxylated oligomer causes the ligation reactions to 

ter rnX^^o? On^clg ^lxtures containing , nmo, each a, a codlr^and com- 
Annealing ana "9 a "° ■ « Tris-H CI pH 7.6/ and 0.1 mM ethylenediaminetetraacetic acid 

S^J&^hr-«^. 600 m,' water ha.. The reactions .are « *«- 
slow v (2 hours) to room temperature. The temperature is decreased to 4 C over one hour by add.n pee to 
SritSST For Ration, equal aliquots of the annealed duplexes are mixed and the solution ,s ad.usted 
S cTntSn TmM MgCI,. mM adenosine triphosphate (ATP). 1 mM DTT. and 15% polyethytene glycol. 
^rf?nrcon^ation o Tris-HCI is 66 mM (from the annealing reactions). The total concentrators , of 9- 
mer ^IZ^TSZlSLmr. Annealed adaptors are included in the reactions at a concentration 
Ttf wCoduce ^gationTrSucts of the correct size and with termini that are compatible wAh s.tes m the 
Sg vector. To obtain a maximum yield of synthetic genes within the^ corrcc 

r « ~^«r> n * *r« «t un with increasing ratios of adaptor duplex:9-mer duplex (e.g.. 1.50. 1.150, l.^UU). 
The "actons ^yfe.d ZftZSfiZTtoi* size range' are pursued. We have determined IM adding 1 
I™i ai, itch adaptor duplex to 50-300 pmol of 9-mer duplexes yields products of 400-600 base patrs. 
Son reasons a« in ?5 m cro.iters Jmi 600 units of T4 DNA ligase (New England Biolabs. Beverly. 
S? ™I Son exceeds at 16* C for 16-20 hours. After ligation, polyethylene glycol is removed by 
eZ2l 3S1 ?£ZZ ? of cnforSorm. The products are concentrated by ethano. precipitin and 

^Te^ion option Product, The concentrated reaction products are e.ecttophoresed on « 

SSS3S33E3£5i«5s£g 

genTs o? approximately this size because COP-1 polypeptides within this range were prevously tested in . 
Chemical trials. The agarose plug containing the synthetic DNA is stored at -20 C. 

Preparation of Expression Vector 

The oREV21 plasmid can be constructed from a plasmid pBQ1. Plasmid pBGI can be isolated I from 
-,c iT^hfst bv weti Known procedures, e.g. using cleared lysate-isopyenic density gradient procedures 
rts £ ™" ^eir was deposited in the E coli host MS371 with the Northern Regional Research 

L^Tatorv fSSL u rSp^rofAgri^ltureTp-io-ria. li.lnois. U.S.A) on November 1. 1984 and vvas 
Laboratory (NRRL, U.S. oe P anr " e "' « constructed from plasmid expression vector 

between pBG1 and pREV2£ are the following: 

V, *2L«»»M1 favor hwmoteeglar ligMlon (200 ul roacOon volume us.rio. SBiMart T4 ligaso 
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standard procedures. . „ . 

1c. The product plasmid. pBG1 N. where the 2160 base pair Ndel fragment is deleted from pBG1. 
was selected fay preparing plasmid from ampicillin resistant clones and determining the restriction digestion 
patterns with Ndel and Sail (product fragments approximately 1790 and 1650). This deletion inactivates the 
rop gene that controls plasmid replication. 

— 2a. 5 ug of pBG1 N was then digested with Eco RI and Bc1 l and the larger fragment approximately 
2455 base pairs, was Isolated. 

2b A synthetic double stranded fragment was prepared by the procedure of Itakura et al. (Itakura, K.. 
J J. Rossi, and R.B. Wallace [1984] Ann. Rev. Biochem. 53:323-356, and references therein) with the 
following structure: 

5' GATCAAGCTTCTGCAGTCGACGCATG 

3' TTCGAAGACGTCAGCTGCGTACGCCT 

AGGCCATGGGCCCTCGAGCTTAA 5' 


CG G ATCCG GTACCCG G G AG CTCG 3' 


This fragment has Bell and EcoRI sticky ends and contains recognition sequences for several restriction 

endonucleases. . . 

2c 0 1 ug of the 2455 base pair EcoRI-Bcl I fragment and 0.01 ug of the synthetic fragment were 
joined with T4 DNA figase and competent cells of strain JM103 were transformed. Cells harboring the 
recombinant plasmid, where the synthetic fragment was inserted into pBG1 N between the BeU and EcoRI 
sites, were selected by digestion of the plasmid with Hpal and EcoRI. The diagnostic fragment sizes are 
approximately 2355 and 200 base pairs. This plasmid is called pREV1. ^ 

2d. 5 ug of pREV1 were digested with Aatll, which cleaves uniquely. 

2e The following double-stranded frag ment was s ynthesized: 
5 CGGTACCAGCCCGCCTAATGAGCGGGC1 MINI IGACGT 3 # 
3' TGCAGCCATGGTCGGGCGGATTACTCGCCCGAAAAAAAAC 5 

This fragment has Aatll sticky ends and contains the frpA transcription termination sequence. 

2f. 0.1 ug ofAatll digested pREV1 was ligated with 0.01 ug of the synthetic fragment in a volume of 

20 ul using T4 DNA~Hgase. 

2g. Cells of strain JM103. made competent, were transformed and ampicillin resistant clones 

selected 

ai Using a Kpnl. EcoRI double restriction digest of plasmid isolated from selected colonies, a cell 
containing the corricTcoliitruction was isolated. The sizes of the Kpnl. EcoRI generated fragments are 
approximately 2475 and 80 base pairs. This plasmid is called pBEVITT and contains the trpA transenpton 

terminator.^ ^ ^ P rev 1TTi prepared as disclosed above (by standard methods) was cleaved with Ndel 
and Xmnl and the approximately 850"6ase pair fragment was Isolated. 

"~3b 5 ug of plasmid pBR325 (BRU Galthersburg. MD). which contains the genes confemng 
resistance to chloramphenicol as well as to ampicillin and tetracycline, was cleaved with Bel I andthe ends 
blunted with Klenow polymerase and dexoynucleotides. After inactivating the enzyme, the mature was 
treated with Ndel and the approximately 3185 base pair fragment was isolated. This fragment contains the 
genes for chloramphenicol and ampicillin resistance and the origin of replication. .• 

3c 0.1 ug of the Ndel-Xmnl fragment from pREVITT and the Ndel-Bcll fragment from pBR32S i were 
ligated in 20 ul with T4~DNA-iig~ase and the mixture used to transform competent cells of strain JM103. 
Cells resistant to both ampicillin and chloramphenicol were selected. 

3d Using an EcoRI and Ndel double digest of plasmid from selected clones, a plasmid was Mlaetad 
giving fragment sIzeTol approximately 2480. 1145. and 410 base pairs. This is called plasm.d pREV1TT/chl 
and has genes for resistance to both ampicillin and chloramphenicol. 

4a. The following double-stranded fragment was synthesized: _ 


6 


10 


2S 


30 


4S 


SO 


EP 0 383 620 A2 

Mlul EcoRV Clal BamHI 

5' CGAACGCGTGGCCGATATCATCGATGG 
3' GCTTGCGCACCGGCTATAGTAGCTACC 
Sail Hindin Smal 
ATCCGTCGACAAGCTTCCCGGGAGCT 3' 
TAGGCAGCTGTTCGAAGGGCCC 5' 


This fragment with a blunt end and an Sstl sticky end. contains recognition sequences for several 
restriction enryme JJJ^^ ^ ^ Nm , (wnich deaves ^ * nude otides from the Bel I 

16 site) and Sstl (which cleaves within the multiple cloning site). The larger fragment, approximately 3990 base 
p^. ^I^S^^SL, ^ pREV1TT/chl and 0.0! ug o, the synthase fragment were 

trpatsd with T4 DNA Hqase in a volume of 20 itl. , . . 

£ This L.L transformed .nto strain JM103 and amplciltin resistant Cones were - JcjA 

4e Plasmid was purified from several clones and screened by d.geston wrth MM or gaL 

Recombinant clones with to new multiple Coning site will give one fragment when d.gested w,th ether of 

fhfl«!a enzvmes. because each cleaves the plasmid once. 

these me , cloning site was vertfied . This was ^ne by restne^ng me ptonMd 

with Hpal and Pvu.l and isolating the 1395 base pair fragment, cloning it into the Smal s.te of mp18 and 
sequencing it bydideoxynucleotide sequencing using standard methods. 

4a. This plasmid is called pREV£2. „. 0 _, ri w „ 

PlaTmid pREV 2.2 can be isolated from Hs E. coll host by well known proced ures. This plasmW was 
deoosited in the E coli host JM103 with thr Northern Regional Research Laboratory (NRRL, U.S. 
DepSnt of^rifultuTe-, Peoria. Illinois. USA) on July 20, 1986 and was assigned^ access.cn number 

NR RasrnufpREV 2.1 was constructed using plasmid pREV 2.2 and a synthetic oligonucieotJde. An 

eX8mP ; e t^^S^SZZ Son eruymes Nrul and BamH, and the 4 Kb fragment Is 

M isolated from an agarose gel. ' . , 

2. The following double-strand oligonucleotide is synthesized: 

5' CGAACGCGTGGTCCGATATCATCGATG 3' 
3' GCTTGCGCACCAGGCTATAGTAGCTACCTAG 5' 


40 


3. The fragments from 1 and 2 are ligated in 20 ul using T4 DNA ligase. transformed into competent 

E coll cells and chloramphenicol resistant colonies are isolated. . .. r . „ frnm tha 

" —4. Plasmid clones are identified that contain the oligonucleotide from 2. spanning the regrcn frnm the 
Nrul site to the BamHI site and recreating these two restriction sites. Th.s plasmid te ™«* P^"' v , ald 
— Clonino tt 1 e - Lio"ation Products into an Expression Vector. The express.cn vector is digested to yield 

,s adju«ea ro to ^ h |ncubated ove m,g ht at 16 C. 

CompSrceils the ..gadon mixtures and recombinants are Identified by se.edon on 

.".strata procedures, .ncluding the best mode, for P****} 
55 .nveSon Sese examples should not be construed as limiting. All percentages are by we.ght and all 
solvent mixture proportions are by volume unless otherwise noted,. 
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Example 1 - Strategy for Synthesis of Random Sequence Genes 

Model studies using several oligonucleotide duplexes were performed to assess this method for 
synthesizing genes encoding polypeptides of predetermined amino acid composition but random se- 
s quences. The synthetic genes were analyzed with respect to size, ligation junctions, composition, sequence, 
and levels of expression. 

Size Synthetic genes within broad size ranges are produced by varying the ratio of adaptors to 9-mer 
duplexes. We are able to select genes within more limited size ranges by resolving the ligation products on 
agarose gels and excising gel slices containing products of a certain length. Using these procedures, genes 
to in the following three size ranges have been isolated: 75-150. 280-320. and 400-600 nucleotides. 

Ligation junctions. The synthetic genes have been sequenced to demonstrate that the 9-mer duplexes 
are joined end to end and without insertion or deletion of any nucleotides. Correct junctions are necessary 
for maintaining the reading frame and thus producing genes that encode polypeptides of the expected 
amino acid composition. Sequence analysis revealed that the junctions between the duplexes are correct 

75 (R9 ^ m 3 ^ osition> Tq CQntroI ^3 amin0 acid composition of the encoded polypeptides, the synthetic genes 
mu st contain th e duplexes added to the ligation reactions. The results from three gene synthesis 
experiments demonstrated that the synthesized genes are composed of the duplexes included in the input 
steps of the synthesis. 

Sequence. Since each duplex can ligate to any other, the order of the 9-mer duplexes in the synthetic 
gen es should be random. This randomness is demonstrated in the Example shown in Figure 6. The 
synthetic gene shown in this example is composed of duplexes encoding the following amino acid 
segments: KKA, EAE, KAK and YKK. 

Expression. To measure expression levels, synthetic genes are cloned into vectors such that the 
pol ypeptides a re expressed either as fusions with heterologous vector-derived peptide sequences, or as 
nonfusion polypeptides. The levels of expression of the fusion products can be readily measured by 
Western blot analysis using antisera directed against the vector-derived portion of the fusion protein. Figure 
7 demonstrates the expression of four COP-1 -containing fusion polypeptides. 

\ 

Example 2 - Variations for Synthesizing Random Sequence Genes with Small DNA Duplexes 

DNA duplexes of other lengths can be used in an approach similar to that described for duplexes of 9 
nucleotides. Since three nucleotides code for one amino acid, strands of 3, 6, 12. 15, or 1 8 nucleotides can 
be annealed forming duplexes that code for small blocks of amino acids (see Figure 8). More sequence 
variation occurs by mixing small rather than large duplexes. 

In addition, terminal extensions of more than one nucleotide can be employed. Figure 8 shows an 
example using duplexes of 12 nucleotides and extensions of 3 nucleotides. XXX represents any codon; the 
codons in the duplexes are varied to produce polypeptides of the desired amino acid composition. This 
approach restricts the polypeptide sequences since the amino acid encoded by the duplex junctions - 
alanine (Ala) in this example - is repeated every fourth amino acid. Also, to another variation of the subject 
invention, extensions of 5' nucleotides can be employed instead of the 3 extensions illustrated throughout 
this report 

Example 3 ■ Synthesis of Single-Stranded Random Sequence Genes 

Gene synthesis using DNA duplexes results in double stranded genes that are ready for cloning into an 
expression vector. An alternative strategy entails producing single-stranded, random sequence genes. Trie 
application of this method to COP-1 is illustrated in Figure 9. Three-nucleotide oligomers corresponding to 
codons for each of the amino acids in COP-1 are synthesized, mixed in appropriate ratios, and chemically 
polymerized in solution to produce long single-stranded COP-1 genes. The complementary strand of DNA 
is made enzymaticaJly using reverse transcriptase or DNA polymerase. The double-stranded DNA is 
prepared for cloning by digestion and repairing the ends of the molecules. 

Single-stranded, random sequence genes could also be made by performing the polymerization step on 
a DNA synthesizer. Typically, synthetic DNA is assembled on nucleotid at a time using phosphoramidite 
nucleotide precursors. We developed a strategy for synthesizing single-stranded DNA In three nucleotide 
segments <"codons n ) by using phosphoramidite trinucleotides (Figure 10). The use of 3-nucieotide buildtng 
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talccks instead of single nucleotides is necessary to ensure that only specific codons. those corresponding 
^ S p^spho^id'te trinucleotides. wou.d occur In the synthetic genes To test 
clrrTmissioned the custom synthesis of a phosphoramidite trinucleotide. We observed polymerization of the 
trinucleotide on the DNA synthesizer. 

Example 4 • Expression of Random Sequence Genes in Vector Producing Fusion Proteins 

Svnthetic random sequence genes can be cloned into gene fusion vectors so that the expressed 
rtJ32 are comprised of a vector-derived polypeptide linked to the random sequence P^ptid^ 
S SSond f this method to COM is described here. Synthetic COM genes are cloned Into the 
™ST^2«2d pREV 2.1 within the polyiinker site. Upon expression from P Rev 2.1 a 
SSI ^synthesized comprising the amincHermina. portion (approximately 25 to approx.mate.y 45 
poiyp« H / orotein linked by a peptide bond to the COP-1 polypeptide. 

"l^lS^SSiSS^ COM y ge^es P for expression in pRev 2, Fusion £»"*^<£ 
f ft ,.nd to be exoressed from ten of the twelve constructs. Figure 7 is a Western blot demonstrating the 
Slon croons ^ pSuced from four different clones, as detected by binding of antisera specrfic for the 
bacteria. Son oT'^e fusion protein. Detection of fusion proteins in the expected size range us.ng antsera 
s^Z -to, Te vector-derived portion Is dependent on the presence of a COM gene sequence smce the 

SI XroSLon po.ypep.ides comprised of 34 amino acids o, Je ve^erived 
orotSn TaPpr'ox rSately 3.900 daltons) and approximately 130-200 amino acids encoded by the random 
Sine ^gen« (15.000-23.000 daltons). Based on migration through SOS gels, the mo ecular weights of 
£ ? ftTsion pXpeptides are in the correct range. Cone 4 produces lower .evels of several smaller 
nnivnantides which may be generated upon degradation of the largest species. 

P y Tnf am^ acids which are at the junction of the vectored and COP-1 POlypeptdes are en^ed 
bv the 5 ? ™ igonucleotJde adaptor duplex. The adaptor duplex can be designed to encode a metone 
reside berween tne bacterial protein and the COM sequences. In this case, the COP-1 PO^P*** «n 
be Released from the fusion protein by treatment with cyanogen bromide which cleaves on the carboxy I 
SmS iSe of methionine residues. Both forms of COM (the fusions and the free polypeptides) are 

^ other fusion vectors can be employed to express COP-1 ^.P*™""; 

and other Segies can be emptoyed to release the COP-1 polypeptides from the bactenal prot eins. For 
e^S ^ to improve the expression of rCOM polypeptides in E. coli. genes codmg ^ ar*d 
rCC^- -i9 were subclonedfrom pREV 2.1 to P BG3-2AN. a plasmid used to express Protein A. PBS3-2AN 
ha? beU deposited as described* U.S. Patent No. 4.691.009. The deposit was made -« •J*™*' 
WB4 "d given the accession number of NRRL B-15910. The rCOP-1 genes were .solated from the pREV 
21 re«mbr a nt P .asmids by digestion with Nco1 and EcoRI. The Nco1 site occurs .n fte 5 Hnker 
«mnteved in cloning the rCOP-1 genes and theTToRI site is in pREV 2.1 downstream of the rCOP-1 gene. 
Se Mon Zl me restriction enzymes. 1^ ends of the rCOM genes are blunted w* deox- 
ySonSSes and K.enow fragment. DNA fragments containing the rCOP-1 •"-^'J**^ 
JrZ, n *» nels DBG3-2AN is digested wtih Nhe1. treated with phosphatase and the ends of the DNA are 
££c wift deoxyribonucleotides and Kleno-wTragment. After ligation and plating. pBG3-2*N recombinants . 

Lino acid sequences for rCOP-1-77 and rCOP-1-19 are shown In Figures « «• ' res^pe^ely^COPJ 

-a^se^ents: YKK.EAE. KAK AAK..and AAA. The N-terminal alanine residue In each sequence ,s left 
behind following CNBr cleavage of the fusion protein. 

The Invention should not be limited to the examples descnbed above. 

Example 5 - Expression of Random Sequence Gen s in Nonjusion Vectors 

Synthetic genes can be cloned Into expression vectors such that ft polypeptide products are not fused 


g 


# ............... % 
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to vector-derived protein sequences. As In fusion vectors, th se non-fusion vectors contain the appropnate 
transcriptional and translation* signals; however, the synthetic genes are linked directly adjacent to the 
translation initiation signal such that they contain a methionine residue as the amino-termlnal amino aad. 
TOs Se methionine can be removed from COM polypeptides by cyanogen bromide cleavage. COM 
polypeptides with and without this amino-terminal methionine are tested for biological actvity. 


Example 6 - Purification of rCOP-1 Polypeptides 

The purification of rCOP-1 polypeptides can be accomplished by a number of methods which are well 
knov^ toTose skilled in this art For example. E. coli celis expressing Protein MCOP-1 1us,on protems are 
orow^ in a fermenter. collected by csntrlfugrton. md lysed using a dynamill. The extract is centnfuged to 
remTve debris, adjured to contain 8 M urea, and chromatographed on an S-Sepharose column usmg a 
Sum chloride gradient for elution. Fractions containing the fusion protein are d.alyzed agamst a soUition 
ofaTyTerol in phosphate buffered saline; the dialysate is centrifuged to remove contam.nating P~te.n S that 
precipitate during dialysis. The Protein A/rCOP-1 fusion protein is cleaved with cyanogen brom.de and the 
rCOP-1 polypeptide is purified by gel filtration and reverse phase HPLC. 


Example 7 - EAE Experiments 

rCOP-1 has been tested for efficacy in suppressing experimental allergic encephalomyelitis (EAE). As 
described above. EAE is a T-cell mediated autoimmune disease that is employed as a model for the human 
disuse multiple sclerosis. EAE experiments are performed essentially as described by Swanborg 
fswanboro RH. [1988] "Experimental Allergic Encephalomyelitis." In Methods In Enzymology, .vol. 162. p. 
ST A^demic Press inc.). For example, the disease Is Induced in Hartley guinea p.gs by a angle. 
^ bcu^eoTs iniecS of 10 ug of guine'a pig myeiin basic protein in 

of Mycobacterium t uberculosis . Onset of disease occurs about 12 to 20 days after .nductlon. The disease is 
s Lred on ascalS of 0-4: 0 = no disease; 1 = toss of coordination In hind limbs: 2 = paralyse of one or 
behind limbs; 3 = paralysis extending to include one or both front limbs, can .nclude '"continence of 
bladder or b^wel; and 4 = extensive paralysis, inability to move. Animals are scored every 2-3 days from 
Se onse^ of Ssease. and most animais spontaneously recover from the disease. Tne treatmerrt protocol 
Tons^s of intramuscular Injections of 500 mg of test material at 1. 6. and 11 days afte, 
SeaTe ^e dosage, route of administration, and schedule for treatments can be vaned. Also. EAE 
experiments can be performed in other species including rats and mice. Other variations of the expenmen- 

" CSoSf ^ZZ^CP^ and rCOP-1-19. have recently been tested In the EAE experiments. 
The production and purification of these molecules were performed in accordance with the procedures 
descSbed ?n Examples 4 and 6. Guinea pigs treated with rCOP-1-77 or rC0P-1-19 we compared to 
SSSS^Hh my* basic protein, a positive centre,, and to an 

13 shows the average EAE scores for each treatment group versus days after Induction. In Table 1. several 
aspects of disease, such as incidence, day of onset, maximum severity, and duration, are compared. 

Table 1. 


Effects on rCOP-1 and MBP on EAE 

Group Duration 2 

Incidence 1 

Day of Onset 2 

Max. : 

Severity 2 

Untreated 

rCOP-1-19 

rCOP-1-77 

Myelin Basic Protein 

7/7 
8/8 
7/8 
8/8 

13.5 ±1.0 (13-15) 
16.8 t 3.7 (13-25) 

17.6 12.8 (15-22) 
18.1 12.6 (15-20) 

2.3 ± 0.6 (1-3) 

2.8 ±1.4 (1-4) 

2.9 ± 1.0 (1-4) 
2.3 ±1.1 (1-4) 

10.8*3.8(4-15) 
8.6 3 * 5.2 (2-15) 
9.9 a 4.5 (2-15) 
5.3 ± 5.5 (2-15) 


1 Incidence = numoer wim v.i^^~.,w.. 

2 Values ar the mean standard deviation. Ranges are in parentheses. 

3 One animal in this group died. mJ _ 


10 
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TS 


20 


25 


■ ... m « a ri,«H as follows- rCOP-1-77 and rC0P-M9 both delayed the onset of diseas 
The results can be summarized «J»™^^L ^ 0 P-1 molecules did not affect other measures of 
using the treatment reg.men described ^e^The rCOP 1 motecu.es did 

disease-incidence. "f™^^ about this panicuiar 

duration of djs ^^^J^^^^^ animals (2.3) is unusuai.y tow; in a pilot 
5 EES ^"iTS: ^severity scores were a and 4. t may be possibie to c P *n«e the 
effects of rCOP-1 by varying the treatment procedure. 

» Fv^nnte 8 ■ Other Applications for Random Sequence Polymers of Amino Acids 
. Gene^'encoding random sequence ^^J^AT^ 

sr;^^ to 

subsequent oxidation of cyste.ne rescues ^^^'^^IZedZ and neutralize these effects. 
Random sequence polypeptodes may be able to ^eract wrth ^agec. " ^ 

Polypeptides of different molecules may confer 

properties such as charge, solub.lity. and ab.lity to absorb "™»" •» f bi ^ 

beneficial effects on damaged hair such as ™.ng For example, a 

hair. The effects may vary depend.ng on the P^ 1 P™ pert Jj! * „°J£7ze *e negative charge of 

^rertoSr^ P0'ypep« d 03 having a predetermined amino acid compos,- 

tion is as supplements for diets deficient in certain amino acids. 

\ 

so Claims 

apt*, ol plowing. ««*«. « "^ l ^ i „ d0 ™ ^SSSort- ^«pl. .dm* 
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predetermined amino acid constituents, said method comprising the synthesis of g nes encoding said 
random polymers, said synthesis comprising the polymerization of small oligonucleotide duplexes, said 
method for making polypeptides further comprising the cloning of said synthetic genes into an expression 
vector and transferring said vector Into a bacteria or eukaryotic cell capable of expressing said polypeptide. 
15. A method, according to claim 14, wherein said bacteria is an Escherichia coll . 
16 A method for making a fusion polypeptide having one portion comprised of all or part of a 
heterologous polypeptide and a second portion comprised of a random sequence wherein said random 
sequence portion has predetermined amino acid constituents, and said method comprises the synthesis of 
genes encoding said random polymers, said synthesis comprising the polymerization of small 
oligonucleotide duplexes, said method further comprising the cloning of said synthetic genes into an 
expression vector adjacent to a DNA sequence encoding all or part of said heterologous polypeptide and 
transferring said vector into a bacteria or eukaryotic cell capable of expressing said fusion polypeptide. 

17. A method for synthesizing and identifying a biologically or Immunologically active polypeptide, or 
mixture of polypeptides, said method comprising the following steps: 

(a) synthesizing genes encoding random polypeptides by polymerization of small oligonucleotide 

duotexes* 

(b) cloning each of said synthetic genes into a vector and transferring said vector into a host capable 
of expressing said polypeptides; 

(c) growing said hosts under conditions which permit the formation of recombinant colonies which 
20 each express one recombinant gene; ^„ ~ 

(d) testing the polypeptide, or a mixture of the recombinant polypeptides, combined either before or 
after isolation from said colonies, for evidence of biological and/or immunological activity; 

(e) where activity is observed for the mixture of polypeptides, generating smaller subsets of that 
combination and testing each of these subsets for biological activity; and 

25 (f) repeating step (e) until the active components) or a suitable mixture thereof Is obtained. 

18. A synthetic gene which codes for rCOP-1-19. 

19. A synthetic gene which codes for rCOP-1-77. 
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Strategy for Synthesizing Random Genes and 
Identifying Recombinant Polypeptides with Biological Activity 


Figure 1 


mixture of 

oligonucleotide 

duplexes 


wvw 


ligate 



vAA/vV 

wvw 


random sequence 
synthetic genes 


wvw 


clone in an expression vector 
transform into host 


\ 


recombinant colonies, 
each contains one 
synthetic gene 


identification of biologically active polypeptides 

i 


v 

pool colonies (e.g.1000) 
y isolate polypeptides 
test lor biological activity 

fractionate into smaller pools of 
colonies (e.g. 10 pools of 100 
colonies) 


select individuaJ colonies 
isolate the polypeptide 
lest tor biological activity 
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Oligonucleotides 
(9-mers) 


anneal 


Duplexes 


llgate 


Synthetic 
Genes 



expression 


Polypeptides 


Ala-AI^AIa-Lys-^^^ 

Ala-Ala-Ala-AIa-A^^^ 

Lys -Lys-Lys-Ala-A^^^ 


.X 


Ala Ala Ala 
. Lys Lys Lys 
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Figure 3 


All Possible 3 Amino Acid Combinations 
and Their Percent Occurence in Cop 1 


AAA 

7.B72 

AEA 

2. £24 

AN A 

6 . 560 

AY A 

1 ,312 

EAA 

2.624 

EEA 

0 .875 

EKA 

2. 187 

t'l A 

0.HJ7 

KAA 

6.560 

KEA 

2. 1U7 

KKA 

5.466 

KYA. 

1 .093 

YAA 

1.31-2 

YEA 

0 .437 

YKA 

1 .093 

'i Y A 

0.219 


AAE 

2.624 

AEE 

0.875 

AKE 

2. 187 

AYE 

0 .437 

EAE 

0.875 

EEE 

0 .292 

EKE 

0 .729 

EYE 

0.146 

KAE 

2. 187 

KEE 

0.729 

KKE 

1 . B22 

KYE 

0.364 

YAE 

0,437 

YEE 

0.146 

YKE 

0.364 

YYE 

0 . 073 


AAK 

6 .560 

AEK 

2. 1B7 

AKK 

5.466 

AYK 

1 .093 

EAK 

2. 187 

EEK 

0 .729 

EKK 

1 .822 

EYK 

0.364 

KAK 

5.466 

KEK 

1 . 822 

KKK 

4.555 

KYK 

0.911 

YAK 

1 .093 

YEK 

0 .364 

YKK 

0.911 

YYK 

0. 182 


AAY 1.312 

AEY 0.437 

AKY 1.093 

AY Y 0.219 

EAY \ 0.437 

EEY 0.146 

EKY 0.364 

EYY 0.073 

KAY 1.093 

KEY 0.364 

KKY 0.911 

KYY 0.1 B2 

YAY 0.219 

YEY 0.073 

YKY 0.1Q2 

YYY 0.036 


A=AIanine 
E=Glutamic Acid 
K=Lysine 
Y=Tyrosine 
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Figure 4 

9-Nucleotide Duplexes and Adaptors 
for Cop 1 Gene Synthesis 


Examples of 9-Nucleotide Duplexes for Cop 1 Genes 

• _ _ _ . . 


I AnAAPRPA 6AAGCAGAA (3* end) 
Coding (5' end ) AAGAAGGCA » 

G TCTTCGTCT (5 end) 

CP) ® 
E A E 


Noncoding (3'end)TTTCTTCC6 
Amino acids K K 


Adaptors: 


(3) 


i ■ GATC 
For the 5' end ^ 


(BamH I) ~^J) 


For the 3' end T AQCT - 


(Sac I) 


EP 0 383 620 A2 


Figure 5 

Construction and Cloning 
Synthetic COP 1 Genes 


m 


5' adaptor 


GATC- 

* - 

(BamH I) 


(OH) 


GATC- 
(BamH I) 


ft 


9-mers 


3' adaptor 


■ AGCT 


ligase 


(Sac I) 


•AGCT, 


(Sac I) 


size-select genes 


clone into vector 


GATC- 


:~agct 



Synthetic Gene 


Plasmid Vector 
cut with BamH \ 
and Sac I 
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Figure 6 

Sequence Analysis of a 
Synthetic Gene 


v w KE A E K ■ A K 
amino acids: Y K k fc m 

nudeotides: T A C A A G A A a|g A A G C A G A a|* A G G C T A A A | 

amino acids: Y KKY K K K K A 
nucleotides: TACAAGAA a|t A C A A G A A Ap A G A AGGCAj 

amino acids: e A E K K AE 


nucleotides: G 


aagcagaa|a.agaaggca|3aagcagaa| 

amino acids: K K A E A E E a 
nucleotides: AAGAAGGC a|g A A G C A G A Ap A AGCAGAAj 

amino acids: K K A K A K E A E 
nucleotfdes: AAGAAGGC a|a AGGCTAAa|bAAGCAGAA J 

amino acids: K K A K K . A K A K 
nucleotides: A A G A A G G C a|a A G A A G G C a|*. A G G CT A A A | 


The lines in the nucleotide sequence mark the Junctions 
between 9-mer duplexes. 


Amino Acids 
A= alanine 
K=lysine 
E=gluumic acid 
V=tyrosinc 
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Figure 7 


Cop1 
Polypeptides 


T3 
C 

2 

I 

m 
t. 
o 

> 



CO 
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Figure 8 

Variations of Random-Gene Synthesis 
using Small Duplexes 


One Nucleotide Extension: 

n morc GCA GAA 
3-mers jcG TCT 


GCA AAA 6AAGCA 
6-mers TCGTTT TCTTCG 


GCAGAAGCAAAA 
12-mers TCGTCTTCGTTT 


\ 


Three Nucleotide Extensions. 12-mer Duplexes: 


duplex XXXXXXXXXGCA 

CGTXXXXXXXXX 


ligate 


none XXXXXXXXXGCAXXXXXXXXXGCAXXXXXXXXXGCA 
9 CGTXXXXXXXXXCGTXXXXXXXXXCGTXXXXXXXXX 


polypeptide Ala aa aa aa Ala aa aa aa Ala aa aa aaa Ala 


xxx» any codon 

aa=» amino acid sp cified 

by codon xxx 
AIa» alanine 
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Figure 9 

Synthesis of Single-Stranded DNA Encoding 
Random Sequence Amino Acid Polymers 


3 Nucleotide Segments for COP 1 

Ala 


ff GCA 3 ' 


ratio: 


Glu 
5 GAA 3 


Lys 


5 AAA' 
5 


3' 


5 TAC 3 


Polymerize 

A. Solution Chemistry 
or B. DNA Synthesizer 


Single-Stranded DNA 


5 f < 


Double-Stranded DNA 


5' 
3' 


-3' 


Enzymatic Synthesis 
of the Second Strand 


.3' 
.5' 


Clone into 
Expression Vector 
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Figure 10 


DMTr 
I 

O 
I 

CH 


Phosphoramldlte Trinucleotide for Random 
Gene Synthesis Using a DNA Synthesizer 


;H p ?1 

"? 


o=p— o 


ch 2 ?: 


o=p— o 


Phosphoramidite 
Trinucleotide 


CH 2 P; 


CN -CH 2-CH 2O N ""CH (CH 3) 2 
CH (CH3)2 


Phosphoramidite 
Mononucleotide 


DMTjr 

? 

CH 2 


CH30 fv| — CH(CH3)2 
CH(CH3)2 
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FIGURE 11 
rCOP-1-77 


K K 


E V K K Y K K Y K K E 


EAEAAKAAKAAAAAA** 

+ - + :7-:::__ fn ^ mrCGA TTTATGTTCTTTTTCC^ATTTCTT 


TTTCTTCGTCT' 
X 


TCGACGACGTCTTCGTCTTTTCCC 
EAAAEAEKA^ 


* E * X K X A K A A A E A E * * * * > E 
CXAC^AATACAACAAAXACAA^-CX^J^A^A^^ 

, , » «■ K K I i' « « * * * * * * K K K * 

^caascag^ctaaaoct^^ 


X E A E K 


K A A A E A 


X A K E 


j^g^aatacaagaaagaagcagaaaaggctaaagaagcagaataa 

TTCT^ATG^CTTTCTCCGTCTTTTCCGATTTCTTCGTCTTATT 
EAEKAKEAE * 


X K 
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FIGURE 12 
rCOP-1-19 



TCGTTTC CGACGTTTCTTCCGT. 

CGTTTCCGACGTCTCj. J.TCGTU I . . r> K A K - 

„ -kakaakxaysa- 

A K A ... gaj^GCAGAGAAAGCAGAGAAAGCAGCTGCTGAA 

"^^T^TCTTCCGTCTCTTTCGTCTCTTTCGTCTCTrrc 

CG ATTTATG CTT CGTTTCTu. <- £ 

vaekaekaek*-"- 

A K Y E A K ,-(-s.AGCAGAGAAAGGAAAGGAA 



cs tctctttcStccticgittcitcc C iti=« 

A E X * K ******** E " A 


+ ""Zj arGTCTCTTTCGTTTCCGACGTATGTTCTTTAJ. 

TTCCGTCTCTTTCGTTTCCGACGTCTCTTT ^ 

^^KAAEKAKAAYK R 

GCGAAAGCAAAGGCTGCATAA 
• CGCTTTCGTTTCCGACGTATT 



